fix deadlock in processWriteQueue #778

woodsaj · 2017-12-07T06:52:02Z

fixes #777

replay · 2017-12-14T16:37:50Z

LGTM

Dieterbe · 2017-12-14T19:23:46Z

the way to update a library is now like so:

dep ensure -update <importpath>; dep prune

looks like there's definitely some useful bugfixes in the gocql library we should have.

Dieterbe · 2017-12-14T19:41:58Z

mdata/store_cassandra.go

 			tracing.Failure(span)
 			tracing.Error(span, err)
 			errmetrics.Inc(err)
+			return nil, err


for better or worse, it used to be by design that we would continue here and just use as much data as we could get back. remember the days before @replay started, and we would see charts with missing data here and there because certain chunks timed out? we would see the timeout errors on our MT dashboard, and MT always tried to make the most out of it.
anyway the importance of "correct or error" (as opposed to "best effort") is become more and more clear as we see with some customers and their alerting queries. so 👍

Dieterbe · 2017-12-14T19:48:16Z

mdata/store_cassandra.go

@@ -290,6 +291,7 @@ func NewCassandraStore(addrs, keyspace, consistency, CaPath, Username, Password,
 		omitReadTimeout:  time.Duration(omitReadTimeout) * time.Second,
 		ttlTables:        ttlTables,
 		tracer:           opentracing.NoopTracer{},
+		timeout:          cluster.Timeout,


note that we default to 1000 ms, this seems too aggressive. looking at some customer dashboards, some customers are seeing multi-second query durations (cassGetExecDuration)

this timeout is only used for writes. reads just use the existing requestContext, which we dont currently set a timeout on and instead just rely on the context being canceled, eg when the client disconnects.

queries will fail if they run longer then cluster.Timeout.

metrictank/vendor/github.com/gocql/gocql/conn.go

Lines 612 to 658 in ca4a891

var timeoutCh <-chan time.Time

if c.timeout > 0 {

if call.timer == nil {

call.timer = time.NewTimer(0)

<-call.timer.C

} else {

if !call.timer.Stop() {

select {

case <-call.timer.C:

default:

}

}

}

call.timer.Reset(c.timeout)

timeoutCh = call.timer.C

}

var ctxDone <-chan struct{}

if ctx != nil {

ctxDone = ctx.Done()

}

select {

case err := <-call.resp:

close(call.timeout)

if err != nil {

if !c.Closed() {

// if the connection is closed then we cant release the stream,

// this is because the request is still outstanding and we have

// been handed another error from another stream which caused the

// connection to close.

c.releaseStream(stream)

}

return nil, err

}

case <-timeoutCh:

close(call.timeout)

c.handleTimeout()

return nil, ErrTimeoutNoResponse

case <-ctxDone:

close(call.timeout)

return nil, ctx.Err()

case <-c.quit:

return nil, ErrConnectionClosed

}

So if we have queries that are running longer then this, then it will be because they are blocked waiting for a stream to become available.

ah yes..

in insertChunk we call context.WithTimeout(context.Background(), c.timeout)
doesn't that mean that in the select either of the 2 operations (between lines 648 and 655 in the code shown above) can succeed at practically the same time?

one comes from the timeout set from c.timeout which is the same timeout value we specified in the config, whereas the ctxDone fires when the context's deadline triggers, which is a deadline we set also based on the same timeout?

IOW i don't see the use for the context on the insert queries since they seem to achieve the same as the already setup timeout mechanism. am i missing something?

c.timeout is only measured from after the request is sent to cassandra.
the context.Deadline is set when the request is constructed, so includes the time the request sits waiting to be sent.

let's rename this field to writeTimeout to avoid confusion, and a note explaining what you said in your above comment would also help a lot to anyone reading the code.

let's rename this field to writeTimeout to avoid confusion

No. CassandraStore.timeout is assigned the same value as the gocql cluster.Timeout, which is for reads and writes.

Variables should be named based on the information they contain, not on how they are used.

Dieterbe · 2017-12-14T19:54:34Z

mdata/store_cassandra.go

+			if err == context.Canceled || err == context.DeadlineExceeded {
+				// query was aborted.
+				return nil, nil
+			}
 			tracing.Failure(span)
 			tracing.Error(span, err)
 			errmetrics.Inc(err)


this line triggers this code:

func (m *ErrMetrics) Inc(err error) { if err == gocql.ErrTimeoutNoResponse { m.cassErrTimeout.Inc() } else if err == gocql.ErrTooManyTimeouts { m.cassErrTooManyTimeouts.Inc() } else if err == gocql.ErrConnectionClosed { m.cassErrConnClosed.Inc() } else if err == gocql.ErrNoConnections { m.cassErrNoConns.Inc() } else if err == gocql.ErrUnavailable { m.cassErrUnavailable.Inc() } else if strings.HasPrefix(err.Error(), "Cannot achieve consistency level") { m.cassErrCannotAchieveConsistency.Inc() } else { m.cassErrOther.Inc() } }

seems weird that various gocql timeouts result in err metrics being incremented, whereas context.DeadlineExceeded does not. I think we should also increment a timeout metric when context.DeadlineExceeded triggers

store_cassandra.go does not set a context Timeout on the read path.

So if context.Canceled or context.DeadlineExceeded is returned, it is because the caller wanted us to give up on the request. There was no issue preventing us from executing the request.

currently, we dont set a deadline on the request context anywhere, so context.DeadlineExceeded will never be returned. It is just here for completeness incase we oneday add a deadline to the context.

wouldn't we want context.DeadlineExceeded to mark the spans as failed and the ErrMetrics to be incremented? it seems to me any such deadline being hit is a failure that should be reported

No. If DeadlineExceeded is reached it is because the caller set a timeout and they should emit their own error metrics and set the span to failed.

The error would have nothing to do with cassandra, so you should not increment the cassandraErr counters

fair enough

Dieterbe

You need to leave a comment indicating the requested changes.

I did, github :(

Dieterbe

just some minor changes needed, see comments.
also don't forget #778 (comment) thanks

Also return an error when a query fails rather then just silently ignoring it.

woodsaj · 2018-01-01T00:48:38Z

This is good to merge @Dieterbe

also: * newer version of dep uses multi-line format * it auto-added a bunch of constraints * needed to pin gocql. it was tricky to determine which version our gocql is supposed to be at the last update was #778 but we don't know what version of gocql that was exactly. None of the versions in gocql's last year of git history matches what we have in our vendor dir, in fact, the smallest diff with any version was still about 480 lines; so it looks like not all go files were copied over. however it seems likely it would have been d9815cdf0ff24e2efa9b8062f4f94a6dd347ae51 because our vendor dir does include that change, not some of the later changes, and the time works with that PR.

woodsaj requested a review from Dieterbe December 7, 2017 06:52

Dieterbe reviewed Dec 14, 2017

View reviewed changes

Dieterbe suggested changes Dec 27, 2017

View reviewed changes

woodsaj added 4 commits January 1, 2018 08:35

update gocql

b12abb6

use a context.WithTimeout when writing data to cassandra

d59b58c

pass the request context to the gocql query.

572140a

Also return an error when a query fails rather then just silently ignoring it.

update gocql

be9f75b

woodsaj force-pushed the issue777 branch from ca4a891 to be9f75b Compare January 1, 2018 00:47

Dieterbe approved these changes Jan 2, 2018

View reviewed changes

Dieterbe merged commit f2f76b7 into master Jan 2, 2018

Dieterbe deleted the issue777 branch January 2, 2018 16:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix deadlock in processWriteQueue #778

fix deadlock in processWriteQueue #778

woodsaj commented Dec 7, 2017

replay commented Dec 14, 2017

Dieterbe commented Dec 14, 2017 •

edited

Loading

Dieterbe Dec 14, 2017

Dieterbe Dec 14, 2017

woodsaj Dec 21, 2017

Dieterbe Dec 22, 2017

woodsaj Dec 27, 2017

Dieterbe Dec 27, 2017 •

edited

Loading

woodsaj Jan 1, 2018 •

edited

Loading

Dieterbe Dec 14, 2017

woodsaj Dec 21, 2017

Dieterbe Dec 22, 2017

woodsaj Dec 27, 2017

Dieterbe Dec 27, 2017

Dieterbe left a comment

Dieterbe left a comment

woodsaj commented Jan 1, 2018

	var timeoutCh <-chan time.Time
	if c.timeout > 0 {
	if call.timer == nil {
	call.timer = time.NewTimer(0)
	<-call.timer.C
	} else {
	if !call.timer.Stop() {
	select {
	case <-call.timer.C:
	default:
	}
	}
	}

	call.timer.Reset(c.timeout)
	timeoutCh = call.timer.C
	}

	var ctxDone <-chan struct{}
	if ctx != nil {
	ctxDone = ctx.Done()
	}

	select {
	case err := <-call.resp:
	close(call.timeout)
	if err != nil {
	if !c.Closed() {
	// if the connection is closed then we cant release the stream,
	// this is because the request is still outstanding and we have
	// been handed another error from another stream which caused the
	// connection to close.
	c.releaseStream(stream)
	}
	return nil, err
	}
	case <-timeoutCh:
	close(call.timeout)
	c.handleTimeout()
	return nil, ErrTimeoutNoResponse
	case <-ctxDone:
	close(call.timeout)
	return nil, ctx.Err()
	case <-c.quit:
	return nil, ErrConnectionClosed
	}

fix deadlock in processWriteQueue #778

fix deadlock in processWriteQueue #778

Conversation

woodsaj commented Dec 7, 2017

replay commented Dec 14, 2017

Dieterbe commented Dec 14, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Dieterbe Dec 27, 2017 • edited Loading

Choose a reason for hiding this comment

woodsaj Jan 1, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Dieterbe left a comment

Choose a reason for hiding this comment

Dieterbe left a comment

Choose a reason for hiding this comment

woodsaj commented Jan 1, 2018

Dieterbe commented Dec 14, 2017 •

edited

Loading

Dieterbe Dec 27, 2017 •

edited

Loading

woodsaj Jan 1, 2018 •

edited

Loading